National Repository of Grey Literature 11 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Methods of Data Extraction from the Web
Perina, Lukáš ; Křivka, Zbyněk (referee) ; Burget, Radek (advisor)
The purpose of this bachelor thesis is to design an architecture and subsequent implementation of an application designed for data extraction (web scraping) from web documents. Unlike conventional methods, it is an extraction based on defining data types and regular expressions of requested elements. Extraction is executed in such a manner, where it is not necessary to know the detailed structure of given web document and the possibility of using just one definition to detect requested elements on different web pages. Algorithm is able to achieve overall accuracy of 85,51% and recall 80,28%. This approach can reduce the time required for analysis of web pages significantly and not to take the structure of the code as a determining factor while creating web scraping requests.
Level scraping systems of settlement tanks
Rusník, Tomáš ; Hort, Filip (referee) ; Brandejs, Jan (advisor)
Main topic of this bachelor thesis is analyse of settlement tanks scraping equipements in sewage plants. Settlement tank at the wastewater treatment plant is part of the so-called bio-lines, which is used for the separation of activated sludge from waste water. Activated sludge settles in the settling zone. Part of the sludge flotates to the surface of the tank, usually in the space between the inflow cylinder and overflow edge, respectively. This sludge, biological or chemical origin, may, at the massive occurrence, outflow to the recipient and causes worse quality of the effluent. The sludge is therefore necessary to remove from surface of the settlement tank. Some of the different systems of sludge collection have been developer during last decades. The equipments and their function are descriped in my work.
Automated Retrieval of Information from the WWW
Žabka, Andrej ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This bachelor thesis deals with data extraction from web (web scraping) and displaying this data. The created tool allows it's user to quickly and simply create a project, that can extract data from multiple web sites and display them in a user friendly fashion. The thesis also contains examples, that showcase the abilities of this tool and were used in it's testing.
A Service for Verification of Czech Attorneys
Jílek, Radim ; Glembek, Ondřej (referee) ; Szőke, Igor (advisor)
This thesis deals with the design and implementation of the Internet service, which allows to objectively assess and verify the reliability and diligence of Czech lawyers based on publicly available data of several courts. The aim of the thesis is to create and put into operation this service. The result of the work are the programs that provide partial actions in the realization of this intention.
Automated Retrieval of Information from the WWW
Žabka, Andrej ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This bachelor thesis deals with data extraction from web (web scraping) and displaying this data. The created tool allows it's user to quickly and simply create a project, that can extract data from multiple web sites and display them in a user friendly fashion. The thesis also contains examples, that showcase the abilities of this tool and were used in it's testing.
Methods of Data Extraction from the Web
Perina, Lukáš ; Křivka, Zbyněk (referee) ; Burget, Radek (advisor)
The purpose of this bachelor thesis is to design an architecture and subsequent implementation of an application designed for data extraction (web scraping) from web documents. Unlike conventional methods, it is an extraction based on defining data types and regular expressions of requested elements. Extraction is executed in such a manner, where it is not necessary to know the detailed structure of given web document and the possibility of using just one definition to detect requested elements on different web pages. Algorithm is able to achieve overall accuracy of 85,51% and recall 80,28%. This approach can reduce the time required for analysis of web pages significantly and not to take the structure of the code as a determining factor while creating web scraping requests.
Analýza textů uživatelských recenzí plaveckých bazénů
Dragolovová, Anna
The work focuses on identification of most frequently commented topics in swimming pools user reviews. User reviews have been scrapped from Google review pages, preprocessed to text mining and machine learning compatible format, vectorized by bag of words and word embeddings approaches and analyzed by topic modelling and cluster analysis. Twenty‐two relevant topics indicating swiming pool management priorities have been found as a result.
A Service for Verification of Czech Attorneys
Jílek, Radim ; Glembek, Ondřej (referee) ; Szőke, Igor (advisor)
This thesis deals with the design and implementation of the Internet service, which allows to objectively assess and verify the reliability and diligence of Czech lawyers based on publicly available data of several courts. The aim of the thesis is to create and put into operation this service. The result of the work are the programs that provide partial actions in the realization of this intention.
Proposal of Part of Company's IS for Monitoring Competitors' Prices
Jobko, Viliam ; Šlosár, Peter (referee) ; Luhan, Jan (advisor)
This bachelor’s thesis is focused on the design of information system for monitoring of competitors’ products and their prices. Its main objective is to design regular collection of these information, their processing and storage. The obtained knowledge will be used to set better pricing policy.
Level scraping systems of settlement tanks
Rusník, Tomáš ; Hort, Filip (referee) ; Brandejs, Jan (advisor)
Main topic of this bachelor thesis is analyse of settlement tanks scraping equipements in sewage plants. Settlement tank at the wastewater treatment plant is part of the so-called bio-lines, which is used for the separation of activated sludge from waste water. Activated sludge settles in the settling zone. Part of the sludge flotates to the surface of the tank, usually in the space between the inflow cylinder and overflow edge, respectively. This sludge, biological or chemical origin, may, at the massive occurrence, outflow to the recipient and causes worse quality of the effluent. The sludge is therefore necessary to remove from surface of the settlement tank. Some of the different systems of sludge collection have been developer during last decades. The equipments and their function are descriped in my work.

National Repository of Grey Literature : 11 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.